ERGODIC JACOBI MATRICES AND CONFORMAL 

MAPS 



INJO HUR AND CHRISTIAN REMLING 

Abstract. We study structural properties of the Lyapunov ex- 
ponent 7 and the density of states k for ergodic (or just invariant) 
Jacobi matrices in a general framework. In this analysis, a central 
role is played by the function w — —7 + ink as a conformal map 
between certain domains. This idea goes back to Marchenko and 
Ostrovskii, who used this device in their analysis of the periodic 
problem. 



1. Introduction and basic setup 

1.1. Introduction. In this paper, we present a general abstract anal- 
ysis of the basic quantities that are commonly used in the spectral the- 
ory of ergodic spaces of Jacobi matrices. Our original inspiration came 
from the work of Marchenko-Ostrovskii on periodic Schrodinger oper- 
ators [15], which is perhaps best known (definitely to us) through the 
reinterpretation of this material that was given in [8, 9]. Marchenko- 
Ostrovskii use certain conformal maps to parametrize periodic prob- 
lems, and the same device can be used in a much more general setting. 
This is one of the main themes of the present paper. 

What we do here has some overlap with earlier work on the direct and 
inverse spectral theory of ergodic and invariant Jacobi matrices, most 
notably with the by now classical contributions of Kotani [5, 13, 14]. 
So some parts of this paper are expository in character. Rather than 
focus exclusively on those parts that (we believe) are new, we have 
attempted to give a unified, coherent presentation that starts almost 
from scratch. In those parts where the results are not new, we usually 
propose alternative arguments. 
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1.2. Basic setup. Let us now try to give a somewhat more detailed 
description of what we will do here. Recall that a Jacobi matrix is a 
difference operator on u G £ 2 of the form 

(Ju) n = a n u n+ i + a„_iw„_i + b 

Alternatively, one can represent J by the following tridiagonal matrix 
with respect to the standard basis of £ 2 (Z): 



J 



\ 



d-2 b-i a_i 

a_i b a 
a h 



ai 



V 
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Here, a n > and 6 n e R, and we also assume that a,b £ £°°(Z). 
Under these assumptions, J is a bounded self-adjoint operator on £ 2 (Z). 
(One often insists that a n > 0, but for what we want to do here, our 
convention works better.) 

We will also impose a uniform bound on the operator norm, and 
we will in fact work with specifically the space J 2 of all such Jacobi 
matrices J that satisfy ||J|| < 2; an arbitrary bounded Jacobi matrix 
will of course lie in J 2 after multiplication by a suitable constant. It 
is often useful to make J 2 a compact metric space; one possible choice 
for such a metric is 



(1.1) 



d(J,j') = Y,^(\a n -a' n \ + \b n -b' n \) 



The topology induced by d may be described as the product topology 
on now thought of as a subspace of the product of the intervals 
[0, 2] and [—2, 2] from which we draw the coefficients a n and b n , respec- 
tively. Alternatively, this topology is also the one induced by both the 
weak and the strong operator topologies, and we now think of J 2 as a 
subspace of B(£ 2 ), the bounded operators on the Hilbert space £ 2 (Z). 

The shift S(a,b) n = (a,6)„ +1 acts as a homeomorphism on (J 2) d). 
Given an S invariant probability (Borel) measure jj, on J 2l we intro- 



duce a w function w 



w, 



as follows. We average the spectral measures 



dpo(t; J) = d\\Ej(t)So\\ 2 with respect to /i to obtain the density of states 
measure dk: More precisely, the map / (->■ J d[i(J) J dp (t; J) fit) 
defines a positive linear functional on the continuous functions / on 
[—2,2], so there exists a unique (probability) measure dk on the Borel 
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sets of [—2, 2] so that 

(1.2) / dn(J) [ dpo(t;J)f(t)= [ f(t)dk(t) 

Jj 2 J [-2,2] J [-2,2] 

for all / G C[— 2,2]. It's easy to see that J 4 J f (t) dp (t; J) is a 
continuous map on J" 2 for fixed / G C[— 2,2]; we will discuss this in 
more detail in the proof Lemma 2.2 below. In particular, this function 
is measurable and thus the left-hand side of (1.2) is well defined. 
We also define A > by writing 

In do (J) d/i(J) = In A, 

at least if J r In dp > — oo. For easier reference, we introduce the 
notation JH for the set of (S invariant, probability) Borel measures \i 
on J 2 that satisfy this additional condition. We then set 

(1.3) w(z) = lnA- [ \n(t - z) dk(t), 

J [-2,2] 

for z G C + , the upper half plane in C. Here we take the logarithm 
with Im \n( G (— 7r, 0) for ( G C - . So in particular w is a Herglotz 
function (a holomorphic function w : C + — > C + ). The harmonic (on 
C + ) function 7(2) = —Re w(z) is called the Lyapunov exponent. 

These are, of course, well known quantities for ergodic systems of 
Jacobi matrices, extended here in an obvious way to measures \i that 
are just invariant. These quantities are often defined in different ways, 
and indeed there are quite a few well known alternative methods to 
introduce w. See [6, 17, 31] for (much) more on these topics. Definition 
(1.3) is straightforward and convenient for our purposes. 

1.3. Overview of main themes. As already announced, one of the 
recurring themes of this paper will be the generalized version of the 
observation of Marchenko and Ostrovskii that w maps C + conformally 
onto an image domain w(C + ) of a certain type, and, conversely, these 
domains can be used to reconstruct w, A, and dk (in fact, this is 
not literally true; it becomes true after a suitable change of variables, 
as we'll discuss below). See the discussion of Section 2, especially 
Proposition 2.4. A variety of other data are available, and we study 
the relations between these in some detail in Sections 3 and 4. Section 
5 contains one of the main results of this paper, Theorem 5.4: Given 
suitable data (for example, given a w function), we can find an invariant 
measure \i G M.q that will produce these data. A less complete result 
of this type was proved earlier in [5]. 
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A new topic is introduced in Section 6. Here we show that the 
correspondence between the gaps of the spectrum and slits of the image 
domain that is one of the cornerstones of the Marchenko-Ostrovskii 
method (and obvious in the original setting) extends to the general case, 
if suitable definitions are made. In Section 7, we study the Lyapunov 
exponent as a function on [—2, 2] (rather than as a harmonic function 



Sections 2-7 form the main part of this paper. The final three sec- 
tions are lighter in tone. In Section 8, we revisit work of Avila-Damanik 
[2] on the positivity of generic Lyapunov exponents from the point of 
view suggested by the material of this paper. Section 9 offers a brief 
discussion of the possibility of finding ergodic (and not just invariant) 
measures //, but what we have to say here does not really go beyond 
the work of Kotani [13], and we present more questions than answers. 
In the final section, we give an easy argument for the invariance of w(z) 
under a class of transformations that includes all Toda flows. 

2. Basic objects 
Given \x G .Mo, define the corresponding w as described above. Write 
w(z) = -7(2) + ink (z); 
notice that < k < 1. Also, let 



be the increasing function that generates the density of states measure 



on C+). 




dk. 



Proposition 2.1. (a) Let 




zeC+ 



Then k is continuous on C + U R. 
(b) The limit 

7(2;) := lim 7(2; + iy) 



exists for all igK. Moreover, 7(2) > on z e C + . 
(c) (Thouless formula) For all z E C + U R, 




In \t — z\ dk(t) 
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These properties are well known for ergodic measures fi G M.®. See, 
for example, [31, Chapter 5]. A discussion of these issues for measures 
\i that are just invariant may be found in [5]. 

Sketch of proof. Perhaps the most interesting part of this proof is the 
one where we establish the inequality 7 > on C + ; once this is avail- 
able, everything else will then fall into place very quickly or at least 
follow from routine arguments. Let us first sketch how this can be 
done, assuming, for the moment, the inequality 7 > 0. 

Indeed, part (c) for z G C + is of course an immediate consequence 
of the definitions of w and 7. Existence of the limit from part (b) can 
then be deduced from (c) by splitting the region of integration into 
the two parts \t — x\ < 1 and \t — x\ > 1 and using monotone and 
dominated convergence, respectively. These considerations also extend 
the validity of (c) to z G R. 

Next, we observe that the inequality 7 > together with the Thou- 
less formula force dk to be continuous measure; equivalently, ki(t) is a 
continuous function on R. 



the limit exists for (Lebesgue) almost every t G R. Since ko(z) is 
bounded, the Herglotz representation of w(z) reads 



In fact, as Im w(z) < ti on C + , we must have D = here. By dif- 
ferentiating (1.3), we obtain that w'(z) = J so Im w\z) > 0, 
or, equivalently, dk (x + iy)/dx > on C + . This implies that k (t) 
is an increasing function on R. Originally, we could only guarantee 
that k (t) was defined off a null set iV C R, but now we can put 
ko(s) = lim t _>. s _ ;t ^iv ko(t) for s G iV to obtain an everywhere (on R) 
defined increasing function fc . It is also clear, by direct inspection of 
(1.3), that k (t) = for t < -2 and k (t) = 1 for t > 2. Thus k 
generates a probability measure dk on [—2, 2], and now an integration 



Define 



(2.1) 



k (t) = lim k (t + iy); 



(2.2) 
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by parts lets us rewrite (2.2) as follows: 
(2.3) w(z) = C + 



oo g 



\n{t-z) - -ln(t 2 + 1) 



k Q (t) dt 



t — z 

Co + lim kp(t) In . 



t=R 



t=-R 



ln(t - z) - -ln(t 2 + 1) 



dk (t) 



/oo 
\n(t-z) dk (t). 
-oo 



Measures in Herglotz representations are unique and we can again con- 
sider w', so it follows from this that dk = dk±. As already observed 
above, k\ is a continuous function on R, and hence so is ko(t) = k\{t). 
Moreover, we defined ko(t), in (2.1), as the boundary value, Lebesgue 
almost everywhere, of the bounded harmonic function ko(z), z G C + . 
The Poisson representation formula now shows that k = k is continu- 
ous on C + U K, as claimed. 

So, as promised, it only remains to show that 7 > 0. We will in 
fact assume this inequality for ergodic fi. This is well known; in the 
ergodic case, 7 can be related to the exponential decay rate of certain 
solutions to the difference equation Ju = zu (thus the term Lyapunov 
exponent). See [31, Chapter 5]. So we will only explain how to gener- 
alize the inequality to invariant yU. As mentioned above, this issue is 
also discussed in [5]; we offer an easy alternative argument here. 

Let 



(2.4) 



Fe{J) 



1 



1 + e 



'J + eJ ), 



where Jo is the Jacobi matrix with a n = 1, b n = 0. In other words, 
we essentially add e to all a's; the denominator 1 + e is not essential 
and is only introduced to make sure that F e (J) G J2 again. Given an 
invariant measure /x, let \i t = F e fi be the corresponding image measure; 
in other words, J f dfi e = |/oF e dfi. 

Then fi t is an invariant measure on the compact subspace 

J® = {J ej 2 : a n > j-i— for all n G Z} 

of J2. Since the ergodic measures are the extreme points of the set 
of invariant measures, there are convex combinations jif^ of ergodic 
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measures Vj, n ,e on J/ 2 , 



„(n)=Vc- I/- 



so that — > // e in weak-* sense as n — >■ oo. By the result for ergodic 
measures, we do have that 7j, n ,e > for the corresponding Lyapunov 
exponents, and since 7„ depends linearly on u, it also follows that 
7e™' ) > 0. Now on J^ € \ the function J h-> In a (J) is continuous, so 
ln^™' 1 — )■ In A e as n — > oo. 

The integrals from the Thouless formula will also converge. To see 
this, we make use of the following simple fact. 

Lemma 2.2. Suppose that p n — > p in weak-* sense. Then also dk n — > 
dk. 

The situation we have in mind here of course includes the assumption 
that /i n , At G .Mo) but the Lemma is also valid, with the same proof, 
for arbitrary finite measures. 

Proof of Lemma 2.2. Let / G C[-2,2]. Then, from (1.2), 



/ f(t)dk(t)= [ dp(J) [ dp (t;J)f(t) 

J JJ-2 J [-2,2] 

= lim / dp n (J) / dp (t; J) f(t) 



lj 2 J [-2,2] 

lim / f(t)dk n (t) 



because J \-> J f(t) dp (t; J) is a continuous function on J 2 . To confirm 
this last claim, it suffices to observe that convergence with respect to d 
is equivalent to strong operator convergence and this, in turn, implies 
weak-* convergence of the spectral measures po- □ 

Thus we now know that ^^(z) — > ^(z) on z G C + . In particular, it 
follows that 7 e > there. 

From the definition of p e and dominated convergence, it is also clear 
that p t — > p in weak-* sense as e -)■ 0+. Hence, as just observed, the 
integrals from the Thouless formula approach the corresponding limit 
as e — > 0+. Finally, monotone convergence shows that 

lnA e = J lna (J)dp e {J) = J In a (F e ( J)) dp{ J) 

= J i n Q °|^ 6 dp( J) -» Jlnao(J) dp(J) = \nA. 
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Hence also 7 e — > 7, so 7 > 0. The harmonic function 7 is clearly not 
equal to a constant, hence cannot assume a minimum value, and thus 
in fact 7 > on C + . □ 

It is also useful to notice the following well known consequence of 
basic potential theory at an early stage: 

Lemma 2.3. A < 1 for any /j, G M.q. 

Proof. Integrate the Thouless formula with respect to dk. Since 7 > 0, 
we obtain that 

0<-\nA + J J \n\t- x\dk{t) dk{x). 

By the definition of logarithmic capacity [22, Definition 5.1.1], the dou- 
ble integral is < In cap [—2, 2] = 0. □ 

This argument also shows that if A = 1, then dk = du>[-2,2], the 
equilibrium measure of [—2, 2]. From this one quickly obtains the well 
known uniqueness result that fi = Sj if A — 1. We don't want to give 
any details here, but see Proposition 9.2 below and its discussion for 
a more general argument of this type. The material from [23] is also 
closely related to these issues. 

We already mentioned several times the fact that w provides a con- 
formal map from C + onto its image. It is advantageous not to work 
with w itself but with a related function that is obtained by changing 
variables, as follows. Notice that 

(2-5) C H- z = z(C) = -C - ^ 

defines a conformal map from the upper semidisk D + — DC] C + onto 
C + . Here, D = {z : \z\ < 1} denotes the unit disk. We can therefore 
introduce 

F:D + ^D + , F{()=e w ^ )) . 
F indeed maps to the upper unit disk because Re w < 0, < Im w < ir. 

Proposition 2.4. F has a holomorphic extension to D, by reflection: 
F(Q = F((). This extended function F is a conformal map from D 
onto F(D) C D, with F(0) = 0, F'(0) = A. 

Since, at least in general, there is some potential for confusion as- 
sociated with this terminology, we should perhaps clarify our use of 
language here: by a conformal map (also known as a biholomorphic 
map) we mean a holomorphic bijection between connected open sets 
(also called regions or domains); in fact, all domains in this paper will 
be simply connected. 
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Proof. It's easy to check that if ( n G D + , ( n — > x G (—1, 1), x ^ 0, then 
Im F(( n ) — >■ 0. Indeed, if — 1 < rr < 0, say, then z n = —( n — l/( n — > 
t > 2, and thus fc(z n ) — >■ 1, by Proposition 2.1(a). Since F = e~ J e l7Tk , 
this gives the claim in this case. In fact, part (c) of the Proposition 
shows us that 7 is continuous near such a t, so F actually approaches 
a negative limit. The case < x < 1 is similar; this time, F converges 
to a positive limit. 

The Schwarz reflection principle therefore provides a holomorphic 
extension of F to D \ {0}. To define F for ( G D~ , we refer to the 
identity ("reflection") F(£) = F((). Moreover, if C G £>+, C ->■ 0, then 
z = — ( — 1/C satisfies |z| — > 00, so 

w(z) = In A - J ln(t - z) dk(t) = - \n(-z) + In A + 0(l/z) 

as C ->• and this leads to F(() = A( + 0(C 2 )- It follows that the 
singularity at ( = is removable and F'(0) = A, as claimed. 

Finally, notice that w is a conformal map from C + onto its image. 
This simply follows from the fact that Im w'(z) > on C + , which we 
already observed (and used) in the proof of Proposition 2.1. It now 
becomes clear that F also maps D + injectively onto a subset of D + 
and D~ in the same way onto the corresponding reflected subset of 
D~ . Moreover, as we observed above, F(I) C / for both I = (—1,0) 
and / = (0, 1). Thus F could fail to be injective only if F(xi) = F(x2) 
for some points X\,X2 that are either both in (—1,0) or both in (0, 1). 
However, it's easy to confirm that 7(— x — 1/x) is strictly increasing 
and decreasing, respectively, on these intervals. Hence F is a conformal 
map, as claimed. □ 

We remark in passing that the Schwarz Lemma now provides another 
simple proof of Lemma 2.3. 

Proposition 2.5. (a) The domain Q := F(D) C D is of the following 
type: If Re ia G tt, then re ta G Q for all r < R. Also, re ia G Q if and 
only if re' 101 G Q. 

(b) A subset Q C D, ^ is open and has the properties stated in 
part (a) if and only if there exists an upper semicontinuous function 
h: S 1 ->■ [0, 1), with h{e~ ia ) = h{e ia ), so that 

n = n h = {re ia : < r < 1 - h(e ia )}. 

In other words, Q is the unit disk with radial slits 

S a = {re ta : 1 - h{e ia ) < r < 1} 

removed; the function h(e ia ) records the height of the slit at angle a. 
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Proof, (a) We first discuss the corresponding claim about the region 
w(C + ) C {u + iv : u < 0, < v < 7r}. Fix v and put 

L v = {ueR:u + iv E w(C + )}. 

We want to show that L v = (—oo,uq(v)). If this were not true, then 
either L v = 0, or there is an interval (a, b) C L v , with a,b L v and 
a < b < 0. This follows because L v is open, so if this set is non-empty 
and not just a half-line, then we can take some other component, which 
will necessarily be bounded. Now L v = is clearly impossible because 
k takes values arbitrarily close to and also other values that come 
arbitrarily close to 1, and w(C + ) is connected. 

Take preimages, that is, write u + iv = w(z(u)) for a < u < b, 
and with z(u) G C + . Clearly, z(u) = x(u) + iy(u) is a continuous 
function of u G (a, b). Moreover, y(u) is injective. This follows because 
Im w'(z) > 0, as we observed above, so 

dk{x + iy) 
dx 

Hence it is not possible for two points Z\ , z<i with the same imaginary 
part to have images w(zi),w(z 2 ) whose imaginary parts agree also. 

So y(u) must be monotone, and in fact it's not hard to check that y{u) 
is strictly decreasing on (a, b) (but we don't really need to know this 
here since an analogous argument would work for strictly increasing 
y(u)). Notice also that the z(u) stay inside a bounded set, because 
7(2) — > 00 as \z\ —> 00. Thus, on a suitable sequence u n — > a, we have 
that z(u n ) — >■ z = x + iy, and here y > 0. It follows that a + iv = 
w(z) G w(C + ), but this contradicts our choice of a. 

By transforming back to F and Q, we now obtain the first property of 
fl for < a < iv, and, by reflection, also for — ir < a < 0. Here, we have 
already made use of the invariance of Q under reflection about the real 
line, but this second property is really obvious from the corresponding 
symmetry of F. 

Next, consider a — 0. If we recall our discussion of the mapping 
properties of F from the proof of Proposition 2.4, then we see that 
the positive values of F(() come from the ( G (0, 1). For these £, the 
variable z = x = —( — l/£ varies over (—00, —2), so 

(2.7) n n (0, 1) = {e~^ : x < -2}. 

The Thouless formula (Proposition 2.1(c)) shows that 7(2;) is strictly 
decreasing 011K —2, and 7(2;) — > 00 as x — > —00, so this set is a ray 
(0, R), as claimed. The argument for a = n is, of course, analogous. 



(2.6) 
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(b) Any domain fl with the properties just established is equal to a 
domain fl h , if we simply define 



Furthermore, it is also clear that only this choice of h can possibly work 
if it is our goal to represent a given fl as an flh for some h. 

Conversely, given any function h : S 1 — >■ [0, 1), we can form the set 
flh- This set will always contain 0. It is open if and only if h is upper 
semicontinuous, and it is invariant under reflection about the real line 
if and only if h is symmetric. Thus, given fl as described in part 
(a), h defined by (2.8) has these properties. Conversely, if an upper 
semicontinuous, symmetric h is given, then flh will be as described in 



We are now in a position to appreciate why it was useful to change 
variables and work with F and fl = F(D) C D rather than w and 
w(C + ) C S = {x + iy : x < 0,0 < y < n}. Since always F(0) = 0, 
-F'(O) > 0, the conformal map F can be reconstructed, at least in 
principle, from its image fl = F(D). This is not true for w. Indeed, if 
fj, = 5aj , where Jo denotes the free Jacobi matrix a n = 1, b n = 0, then 
wa(z) = w (z/A), and w maps C + onto the full strip S. This latter 
statement follows easily without any calculation from simple properties 
of dk and 7 for the free Jacobi matrix J , but one can also use the 
explicit formula 



instead. Here, we would have to clarify the precise definitions of the 
logarithm and the square root, but in fact a much more transparent 
formulation is obtained if we just say that -Fo(C) = C 

So w A (C + ) = S for all < A < 1, and the image under w does not 
distinguish between these w functions. The domains Ha C D, on the 
other hand, have slits at a = 0, ir of A dependent heights, so are not 
equal to one another. One can verify directly that these slits become 
invisible if we transform back to w and z. Theorem 6.1 below will 
throw some additional light on this issue. 

The slit height function h is closely related to the Lyapunov exponent 
7. In fact, it is essentially 7, plus the change of variables F = e w , 
a = nk(t). 

Theorem 2.6. For < a < ir, we have that 



(2.8) 



h(e ia ) := sup{r > : (1 - r)e ta £ fl}. 



(a). 



□ 




h(e ia ) = l-e- L(a \ 
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where 

L(a) = sup{7(t) : -2 < t < 2,nk(t) = a}. 

If t G E — top supp dk and t is not an endpoint of a component 
(a, b) C (—2, 2) \ E, then there is no s ^ t with k(s) = k(t) and thus 
for these t, the formula above takes the simpler form 

h(e™ k V) = l-e-^. 

Recall in this context that top supp dk, the topological support of dk, 
is defined as the smallest closed subset £cl with k(E c ) = 0. 

Also, the set fc _1 ({a:/7r}) n [—2, 2] is either a single point or a closed 
interval [a, 6], because k(t) is increasing and continuous. In the second 
case, the interior (a, b) is a component of (—2, 2) \ E. 

Proof. It is again more convenient to discuss the analogous claim about 
the region w(C + ). So, for < v < 1, define 

H(v) = sup{w > : — u + iirv ^ w(C + )}. 

We want to show that 

(2.9) H(v) = L{v). 

Now for any t G (—2, 2) with k(t) = v, we certainly have that —~f(t) + 
inv w(C + ). Indeed, if — j(t) + iirv = w(z ) for some z G C + , then, 
by open mapping, the image of a small disk D r (z ) under w would 
include a disk about —7(2) +inv , but at least some of these points also 
occur as images of t + iy for small y > 0, and this contradicts the fact 
that w is injective. Thus H(v) > L(v). 

On the other hand, if we had that H{v) > L(v), say 

(2.10) H(v)>-y(t) + e 

for all t G (—2, 2) with k(t) = v, then we can again look at the preim- 
ages of —u + inv for u > H(v). As in the proof of Proposition 2.5, write 
—u + inv = w(z(u)). We now let u approach H(v). As above, the z(u) 
will stay inside a bounded set, so will converge to a limit to £ C + U K. 
along a suitable subsequence. In fact, to £ C + is impossible here be- 
cause then —H(v) + mv = w(t ) would lie in w(C + ). Thus t G R. 
Since k is continuous on C + U R, we can conclude that k(t ) = v, and 
now (2.10) demands that 7(^0) < H{y) — e. The function 7 is upper 
semicontinuous, so this inequality would prevent u = 7(2(14)) from ap- 
proaching H(v) when we send u — > i?(i>) along the subsequence chosen 
above. We can escape this absurd situation only by abandoning (2.10). 
We have established (2.9). 
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This gives the Theorem for a 7^ 0, ir. The remaining cases a = 0, tt 
do not pose any problems; it suffices to refer to what we discussed 
already above. See especially (2.7). □ 

3. Data sets 

Let us summarize: Starting out from an invariant measure \i G M.q 
on J 2 , we introduced the density of states dk as the average of the 
spectral measures p an d hiA — f \na d/i > — oo. These have the 
property that 

(3.1) -\nA + In \t - z\ dk(t) > for z G C. 

J [-2,2] 

We then introduced a variety of additional data, which were computed 
from (A, dk). We will now show that we can go back and forth between 
these. More precisely, each of the following is determined by and will 
determine (A, dk): 

• the w function w(z) on z E C + ; 

• the Lyapunov exponent 7(2) on z G C + ; 

• the conformal map F : D — >■ D; 

• the image domain f2 = F(D); 

• the slit height function h 

We will also identify the classes of objects obtained in this way. For 
easier reference, we give names to the corresponding sets. 

Definition 3.1. We say that: 

(1) (A,dk) G V (density of states) if A > and dk is a probability 
measure on the Borel sets of [—2,2] and (3.1) holds; 

(2) W G W (w function) if W, W are Herglotz functions, W maps C + 
to the strip S = {x + iy : x < 0,0 < y < 71}, W' extends holomorphi- 
cally to C\[-2, 2] by reflection W'{z) = W'(z) and liim^ yW'(iy) = i; 

(3) T G C (Lyapunov exponent) if T,dT/dy are positive harmonic 
functions on C + , T extends harmonically to C \ [—2, 2] by reflection 
F(z) = F(z), and 

hm -r^- = 1; 

y^oo In y 

(4) G G C (conformal map) if G : D — > Q is a conformal map onto a 
region Q C D of the type described in Proposition 2.5, with G(0) = 0, 
G"(0) > 0; 

(5) Q G 72. (region) if C is a region of the type described in 
Proposition 2.5; 
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(6) g G H (height function) if g : S 1 — > [0, 1) is a symmetric (g(e * a ) = 
g(e ia )) upper semicontinuous function. 

Theorem 3.1. If (A,dk) G V is given, then the associated data w, 
7, F, VL, h have the properties from parts (2)-(6) of Definition 3.1. 
Conversely, if an object of one of these types is given, then there exists 
a unique pair (A, dk) G V that is associated with it. 

At this point, this statement seems to be of conditional type because 
we have not yet shown that every (A, dk) G V is actually obtained from 
an invariant measure \i G A4q, and indeed, we will leave this issue com- 
pletely aside in this section and the next. However, as we will discuss 
later, this statement is true; see Theorem 5.4 below. For now, it will be 
important to observe that nowhere in the developments that started 
with Proposition 2.4 did we use the fact that (A, dk) were obtained 
from a /i G .Mo; rather, it was only property (3.1) that mattered. 
Similarly, Proposition 2.1 continues to hold if we just assume (3.1). 

We again witness the effect that things become particularly trans- 
parent on the level of the conformal maps. Note, for instance, that 
items (2), (3) from Definition 3.1 come with a sizeable amount of fine 
print, and contrast this with the satisfying fact that all symmetric up- 
per semicontinuous functions occur as slit height functions. 

In one part of the proof, we will make use of several classical results 
on conformal maps and their boundary values. This material will also 
be important in subsequent sections, so let us give a brief review now. 

The first tool is the notion of kernel convergence for the image do- 
mains Q. For a careful discussion of this topic in a general setting, 
please see [7, Section 15.4]. We give the basic definition in the form 
most suitable for our purposes here, and specialized to the case that is 
of interest to us. 

Definition 3.2. Let Q n , Q C D be subdomains of the unit disk of the 
type discussed in Proposition 2.5. We say that Q n — > f2 in the sense of 
kernel convergence if: 

(i) If z G Q, then there exist a radius r = r(z) > and an index 
N = N(z) so that D r (z) C Vt n for all n> N. 

(ii) If z ^ and r > are given, then there exists N = N(z, r) so that 
D r (z) is not contained in fl n if n > N. 

To confirm that this is indeed what [7, Definition 15.4.1] says in the 
present context, observe that the kernel with respect to Zq — (as 
defined in [7]) of a sequence of domains of the type J\ n , if it exists, 
is another domain of the type f^. In particular, there is no need to 
take a specific connected component of the set introduced in [7]. The 
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general definition of a kernel also demands that D r (0) C Q n for some 
r > and all large n, but this is a consequence of (i) here because we 
always have that G fl 

This notion is important for us here because kernel convergence of 
the image domains is equivalent to the locally uniform convergence of 
the conformal maps from D onto these domains. We will return to 
these issues shortly, but let us first give a characterization of kernel 
convergence in terms of the associated slit height functions h. 

Lemma 3.2. Let Q n , Q C D be domains of the type discussed in Propo- 
sition 2.5, and let h n ,h be the associated slit height functions. Then 
the following are equivalent: 

(a) Q n Q in the sense of kernel convergence; 

(b) sup9?/i n — > sup ifih for every ip G C^S 1 ), ip > 0. 

Proof. We first verify that (b) implies (a). Let's start with condition 
(i) from Definition 3.2. Fix an arbitrary point z G f^, say z = re ia . 
Then r < 1 — h(e ia ). The case r = is easy: We have that sup h < 1, 
so condition (b) with ip = 1 shows that also sup h n < 1 — 8, uniformly 
in n, for some 5 > 0, and thus D s (0) C Vl hn for all n. So we can now 
assume that < r < 1 — h(e ta ). Since h is upper semicontinuous, 
we will have that h < 1 — r — 2e, say, on a suitable neighborhood of 
e ia , for some e > 0. We can now use (b) with a function (p that is 
supported by this neighborhood, equal to 1 on a smaller neighborhood 
of e ta , and takes values < ip < 1. Assumption (b) then says that for 
all sufficiently large n, we will also have that h n (e l13 ) < 1 — r — e, say, 
uniformly on some neighborhood \/3 — a\ < rj. In particular, this shows 
that D$(z) C Vl hn for all these n, if we take 5 < min{e, r?7/100}, say. 

Let's now move on to condition (ii) from Definition 3.2. We are 
given & z Q h and a radius 5 > 0. The assumption that z = re ia ^ Qh 
means that r > 1 — h(e ia ). Pick a function < ip < 1 that is supported 
by |/3 — a \ < 5/10 and equal to 1 at e ta . Condition (b) then provides 
angles (5 n from this neighborhood so that h n {e l ^ n ) > 1 — r — 5/2 for all 
large n. In particular, this shows that D$(z) is not contained in flh n 
for these n, as desired. This concludes the proof of the implication (b) 
=► (a). 

We now want to show that, conversely, (a) implies (b). Fix ip G 
V 9 ^ 0- We would first like to show that 

liminf (sup <ph n ) > sup iph. 

n-^oo 

The upper semicontinuous function <ph assumes a maximum on the 
compact set S 1 , so sup iph = ip(e ia )h(e %a ) for some e ia G S 1 . We may 
assume here that (p(e ia )h(e ta ) > because otherwise what we're trying 
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to show is trivially true. In fact, for convenience, let's also assume that 
(p(e ia ) = 1. We have that (1 — h(e ia ))e ta £ Q h: and now (ii) from 
Definition 3.2 shows that for any 5 > and all large n > N — N (5), 
we must have h n (e il3 ) > h(e lOL ) — 5 somewhere on a — 5 < (3 < a + 5, 
say. Since ip is continuous, it will satisfy ip > 1 — rj on this interval, 
and here rj > can be made arbitrarily small, provided we start the 
argument with a sufficiently small 5 > 0. Putting things together, we 
conclude that sup iph n > sup iph — 5 — i] for all large n. As discussed, 
5 + r] can be made arbitrarily small here, so this is what we wished to 
show. 

It remains to prove that also 

(3.2) sup <ph > limsup(sup (ph n ). 

n— >oo 

Again, the suprema are really maxima, attained at e tan , say. We can 
now pass to a subsequence on which we converge to the limsup from 
the right-hand side of (3.2), and then pass to a subsequence a second 
time to make the points converge, say a n — > a. If (3.2) were wrong, we 
would have that ip(e ia )h(e ta ) < ip(e ian )h n (e ian ) — e , for some e > 
and all large n from the subsequence that was chosen. Since ip is 
continuous, it would then also follow that 

(3.3) h(e ia ) < h n {e ian ) - e, 

for these n and some new (possibly smaller) discrepancy e > 0. Now 
obviously z Q := (1 — h(e ta ) — e)e ia G but (3.3) says that given 
any radius 5 > 0, no matter how small, the corresponding disk -D^o) 
will not be contained in Qh n for infinitely many choices of n. This 
contradicts condition (i) from Definition 3.2. □ 

The second set of classical results on conformal maps that will play an 
important role here deals with the boundary values of these functions. 
The fundamental result in its general form says that a conformal map 
F : D — > Q extends to a homeomorphism F : D — >■ f2, where Q is 
the union of Q with the collection of its prime ends, endowed with 
a suitable topology. Please see [7, Sections 14.2, 14.3] for a careful 
discussion; the result just mentioned is stated as Theorem 3.4 of [7, 
Section 14.3]. For now, we will need the theory of prime ends only for 
regions of a relatively simple type; later on, in Section 6, prime ends 
will make another appearance. In both cases, the material from [7, 
Sections 14.2, 14.3] will provide more than adequate background. 

After these digressions, we now return to Theorem 3.1. When we 
prove this, one assignment will be the task to construct (A, dk) G 
V, given a region Q G 71. For regions of a certain simple type, this 
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problem admits an explicit solution, and we will base our treatment of 
the general case on this. 

More precisely, call a domain Q G 1Z a finite gap domain if the 
corresponding slit height function h is non-zero only at finitely many 
points. So these are regions with finitely many slits; we call them finite 
gap domains because they correspond to finite gap Jacobi matrices, 
that is, reflectionless Jacobi matrices whose spectrum is a finite gap set 
(a disjoint union of finitely many compact intervals of positive length). 

Lemma 3.3. Suppose that Q G 1Z is a finite gap domain. Then there 
exists a finite gap set E C [—2, 2] so that Q is the region associated 
with A = cap E, dk = duE- 

Here, cap E again denotes the logarithmic capacity of E, and ooe 
is the equilibrium measure of E. Please see [22, 26] for background 
information on potential theory. The proof will show that E can be 
obtained as the inverse image of dD under the (extended) conformal 
map z !->■ F — e w ( z \ 

Note also that J* In \t — z\ duE{t) > In cap E for all z G C for a finite 
gap set E, so (3.1) holds and thus (A, cLje) is an admissible set of data 
from the class V. 

Proof. Let F : D — > Q the unique conformal map onto Q with F(0) = 0, 
F'(0) > 0. It is easy to find the set of prime ends for a finite gap region 
Q. We can conveniently identify this set with a set built from the 
boundary dQ as follows. We use two copies of each slit (minus its 
end point) {re ia : 1 — h(e za ) < r < 1}. Let's call these S + (a) and 
S-(a). Then there is a natural bijection between the prime ends of 
f2 and the union of these S± with the rest of dil. Moreover, using 
this identification, we can also easily describe the topology of Q, the 
union of Q and its prime ends. The topology is in fact the obvious 
one, if we think of Q as the union of Q and its boundary, but with 
each slit having two "sides," and points from one side of a slit are not 
close to those from the other side. More formally, we can say that if 
(re ia , +) G S+(a), say, then a neighborhood base is given by the sets 

U e = {pe if) : \p - r\ < e, a < (3 < a + e} U {(pe ia , +) : \p - r\ < e} 

for small e > 0. Of course, similar descriptions are available at other 
points, but we will leave the matter at that. 

Recall that we know from [7, Theorem 14.3.4] that F extends to a 
homeomorphism F : D — >■ f2. In particular, F maps dD homeomorphi- 
cally onto the prime ends of Q. By mapping the prime ends back to the 
correponding points in the complex plane, we also obtain a continuous 
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map Fq from dD onto dQ (the boundary is now taken as a subset of 
C). This map is not a homeomorphism; every point on a (half-open) 
slit has two preimages. The inverse image of dD under this map F is 
a finite disjoint union of subarcs of dD; the number of subarcs is equal 
to the number of slits. 

We now transform back to a putative W function, using the change 
of variables from Section 2. Observe that since Q is invariant under 
reflection about the real axis, so is F: we have that F(() = F(Q. 
This implies that F(D n R) C D H R, and since F'(0) > 0, we also 
see that F(D + ) C D + , F(D~) C D~ , where we again abbreviate 
D ± = D fl C ± . Thus we can take a holomorphic logarithm on D + and 
define W(z) = lnF((), with < Im W(z) < n for z G C + , and z and 
( are related by (2.5). This function W maps C + conformally onto the 
strip S = {x + iy : x < 0, < y < n} with finitely many horizontal 
slits of the type 

S(y, d) = {x + iy : — d < x < 0} 

removed. What we just said about the boundary behavior of F and F 
translates into similar statements about W. More precisely, W extends 
continuously to the boundary (9C + = R and maps 1R onto the union of 
dS with the slits S(yj, dj). Every point of S(yj, dj) \ {—dj + iyj} has 
two preimages, all other boundary points have one preimage. 

The points 2 G 1 \ [—2,2] correspond to ( G (—1,1), which are 
not in the boundary of the original domain D, but of course that is 
no problem at all because F is holomorphic there and thus definitely 
extends continuously. Somewhat greater care is required to handle 
possible slits at a — 0, ir. Here, we observe that we obtain precisely 
one side of such a slit as the image of F, restricted to D + . This follows 
from the reflection symmetry of F . 

Let 

E = W' 1 ({iy :0<y<n}). 

As explained above, E is a finite gap set; it is the inverse image under 
(2.5) of a finite disjoint union of closed subarcs of dD. Since, under 
(2.5), only the z G [—2,2] produce values ( G dD, we also know that 
EG [-2,2]. 

Next, write W = — / ~f + iirk. Since k is continuous up to the real line, 
the Herglotz representation of this function reads 

Clearly, the fact that W maps to S forces B = 0. Moreover, k is an 
increasing function. To see this, first recall that W maps the interior of 
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E bijectively onto {iy : < y < ^}\{Wj}- Thus k is monotone on each 
interval from E. On the other hand, if (a, b) C E c , then we have to 
map this set under W to the union of the slits and the top and bottom 
parts of dS. As (a, b) is connected, we in fact have to map to a single 
such horizontal segment, and we now see that k is constant on (a, b). 
Putting things together, we conclude that k is monotone on R. Finally, 
arguments t < — 2 correspond to ( G (0, 1), and since F'(0) > 0, these 
get mapped to positive values again under F, hence k(t) = for these 
t. Similarly, k(t) = 1 for t > 2. 

To summarize: k(t) is strictly increasing on the interior of E and 
constant on each component of the complement, and k increases from 
to 1. In particular, k generates a probability measure dk that is 
supported by E. 

We can now run the integration by parts calculation from (2.3) again. 
We obtain that 



This formula was derived for z G C + , but it remains valid for the 
continuous extension of W to z G R because Re W < on C + , and 
now the arguments from the proof of Proposition 2.1 yield (3.4) on 
zGl also. 

Let's take a look at 



From the mapping properties of W, we know that $ = C on E, the 
support of dk, but $ < C on C \ E. These properties identify $ as 
the equilibrium potential of the set E (so dk = cLoe) and e c as the 
logarithmic capacity of E; see [26, Theorem 1.3.1] and also Remark 1.5 
from Section 1.1 of this reference. 

So if we use these data (A, dk) = (cap E, du E ) G V as our input, then 
we will obtain the finite gap domain Q G 1Z we started out with. □ 

We are now ready for the 

Proof of Theorem 3.1. We will focus on the existence part ( "Conversely, 
...") exclusively. Indeed, except for small details, which we leave to 
the reader to fill in, our discussion from Section 2 has already shown 
that the data we introduced have the stated properties. As mentioned 
above, it is important to note here that our arguments only used (3.1); 
it was not essential that in the original setting, (A, dk) were obtained 
from a measure /i G M - It is also easy to see that each of the data 
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from Definition 3.1 determines (A,dk), so we will not spend any time 
on uniqueness, either. 

With these preliminaries out of the way, suppose now that aW G W 
is given. We want to construct (A, dk) G V so that 



W{z) = In A - J \n(t - z) dk(t). 



The properties of W in particular ensure that W'(x) = W'(x) for 
x G R \ [—2,2], that is, W is real at these points. Therefore, the 
Herglotz representation of W' takes the form 

dk(t) 



(3.5) W\z) = C + Dz + J 



2.2 



t-z' 



with a finite measure dk and C G R, D > 0. In fact, the asymptotics 
of W immediately imply that C = D = 0, dk(M.) = 1. Thus indeed 



w(z) = B ~ J Ht-z) dk (t)- 



As usual, we take the logarithm with imaginary part in (0, tt) here. By 
assumption < Im W < 7r on C + , and we can now consider W(Re ia ) 
with < a < tt and large R > to conclude that Im B = 0. In other 
words, we can indeed write B = In A for some A > 0, and since also 
Re W < by assumption, it then follows that (A, dk) satisfy condition 
(3.1), as required. 

Assume now that we are given a function r G C The argument, 
unsurprisingly, will be quite similar to what we just did. Introduce 
W(z) = — T(z) +inK(z), where ttK is a harmonic conjugate of — r on 
C + . This determines K up to a constant, which will be irrelevant here 
and can be chosen arbitrarily. The Cauchy-Riemann equations show 
that 

t tx,// • \ dK(x + iy) dT(x + iy) 
Im W (x + ly) = n ~~ — = — ^- — > 

ox ay 

on C + . In other words, W is a Herglotz function. 

Consider now the extended function T on C\ [— 2, 2]. By assumption, 
F(x + iy) is an even function of y G R for fixed \x\ > 2. Thus dT/dy is 
odd, and, in particular, 



dF(x + iy) 



dy 



= 0. 

y=0 



In terms of W, this says that the imaginary part of this function is zero 
on R\ [—2, 2]. Thus the associated measure is supported by [—2, 2] and 
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finite, and we again have a representation of the type (3.5). Integrate 
and take real parts to obtain that 



-Cx 



\d(x 2 -y 2 ) + B + / \n\t- z\dk{t). 

2 J [-2,2] 



Since T > 0, we must have that C = D = here, and then the 
information on the asymptotics from Definition 3.1 shows that dk is a 
probability measure. We can again write B = — In A, with A > 0, and 
(3.1) is of course automatic. 

In the remaining parts, we will not give a direct construction of 
(A,dk). Instead, we will approximate and then make use of compact- 
ness properties. More specifically, recall that we already discussed the 
case of a finite gap domain in Lemma 3.3, and we will approximate 
a general domain by these. So assume now that a G G C is given, 
let Q = G(D) be the corresponding image domain, and denote the 
associated slit height function by h. 

Let 



(3.6) h n (e ia ) = 



H n (j) a = jn/n (j = 0, 1, . . . , n) 
otherwise 



more precisely, we define h n by such a formula for < a < n and 
then extend symmetrically to the lower semicircle. Here, the H n (j) are 
defined as follows: 

(3.7) H n {j)= sup h (e^> +5 )) . 

-l/n<5<l/n 

It is then clear that the h n are slit height functions of finite gap domains 
Q n . We claim that h n — > h in the sense that the condition from part 
(b) of Lemma 3.2 holds. The argument is quite similar to what we 
did in the second part of the proof of this Lemma. Let (p G C(S 1 ), 
(p > 0. From the definition of h n , we have that if h n (e ia ) > 0, then 
h n (e ia ) = h(e t/3n ) for some f3 n = (5 n (a) with \j3 n — a\ < rc/n. Hence 

<p(e ia )h n (e ia ) = <p(e^)h(e^) + R n (a), 

and here the error R n may be estimated by the modulus of continuity 
of if: 

\Rn\ < ^(ip) = SU P \f(e^) - f(e W )\ . 

\8\<w/n,em 

Since f is uniformly continuous on S 1 , we have that ou w / n — > as 
n — > oo, and it follows that limsup(sup<^/i„) < supy^/i. 

On the other hand, sup iph is attained at some point e ta G S 1 , and, 
by construction, h n (e ll3n ) > h(e za ) at some point \j3 n — a\ < ir/n. Since 
f is continuous, this implies that liminf (sup fh n ) > sup iph. 
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Lemma 3.2 now informs us that Q n — > f2 in the sense of kernel 
convergence. By Caratheodory's Theorem [7, Theorem 15.4.10], the 
kernel convergence of the image domains is equivalent to the locally 
uniform convergence of the conformal maps G n : D — > Q n (normalized, 
as usual, by agreeing that G n (0) = 0, G' n (0) > 0), to the limit G. 

By Lemma 3.3, G n (() = e w "^ and 

(3.8) w n {z) = \nA n - J ln(t - z) dk n {t) 

for certain data (A n , dk n ) G V (we actually have much more explicit 
information on what these are, but will not use this here). We can now 
pass to a subsequence (which, for better readability, we will not make 
explicit in the notation) so that A n — >■ A and dk n — > dk in weak-* sense. 
Recall in this context that A n = G' n (0), and since G' n (0) ->■ G'(0) > 0, 
we can be sure that A > 0. The measure dk is a probability measure 
on [-2,2]. 

Taking limits in (3.8), we conclude that 

w n (z) ->■ w{z) = la A - J ln(t - z) dk(t) 

on z G C+. Thus G n (() = e w "^ ->■ e 1 "^, and it follows that G = e w . 
Put differently, G is obtained from (A, dk). Since G(D) C -D, it follows 
that Re w < 0, so (3.1) holds and (A, dk) E V. 

If a domain Q G U or a slit height function g G "H is given, we 
can define an associated conformal map G : D — > Q (with Q = Q g in 
the latter case) and then use this treatment to again produce a pair 
(A, dk) G T> that corresponds to the data that were given. □ 

The question of whether and how compact subsets of R can be ap- 
proximated by periodic spectra (that is, spectra of periodic Jacobi ma- 
trices) has received some attention, and completely satisfactory answers 
were obtained in at least three independent works. These are [3, 18, 32] 
but see also [29, Sections 5.6, 5.8] for a comprehensive discussion. In 
all four cases, the effort needed was not inconsiderable. The approx- 
imation procedure implemented above, see (3.6), (3.7), together with 
material that we will discuss in the following section, could be used to 
give a tremendously simplified treatment. 

4. Convergence of data 

Most of the data sets introduced in the previous section come with 
natural topologies. It seems reasonable to ask what the relations be- 
tween these are. 
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Theorem 4.1. Suppose that (A n ,dk n ), (A,dk) G V, and form the as- 
sociated objects, as above. Then the following are equivalent: 

(a) A n — >■ A and dk n — > dk in weak-* sense; 

(b) w n (z) — > w(z) locally uniformly on C + ; 

(c) 7 n (,z) — > 7(2) locally uniformly on C + ; 

(d) F n (() — > F(Q locally uniformly on D; 

(e) Vt n — )■ VL in the sense of kernel convergence; 

(f) sxvp(ph n — > sup iph for every ip G C(S r ), ip > 0. 

Proof. These statements are either obvious or follow from previously 
discussed material, so we can go through this quickly. Clearly, (a) 
yields pointwise convergence of the w functions, and a normal families 
argument then improves this to give the full claim of (b). Obviously, 
(b) =>- (c). If (c) is assumed and an arbitrary subsequence is chosen, 
then we can make A n — > B > and dk n — > dv in weak-* sense on 
a sub-subsequence (which is not made explicit in the notation) and 
then pass to the limit in the Thouless formula along this sequence to 
conclude that 



We now see, first of all, that B > here, and from the uniqueness 
of such representations we in fact infer that (B,du) = (A,dk). So it 
turns out that (A, dk) is the only possible limit point of the sequence 
(A n , dk n ), and from the compactness property just used we now obtain 



Next, if we recall how F was constructed from w, it is also clear that 
(b) is equivalent to the locally uniform convergence of F n to F on D + , 
which is equivalent to (d), by a normal families argument. 

We already observed earlier that the equivalence of (d) and (e) is 
exactly what Caratheodory's kernel theorem [7, Theorem 15.4.10] has 
to say in the case at hand. Finally, (e) <^=^> (f) is Lemma 3.2. □ 

These spaces V, W etc. from Definition 3.1 become compact if we 
add a degenerate object, which we can think of as corresponding to 
// ^ Mo. We will discuss this in more detail in a moment. We first 
present the analog of Theorem 4.1 for approach to this added object. 

Theorem 4.2. Let (A n , dk n ) G V, and introduce the corresponding 
objects, as in Theorem 3.1. Then the following are equivalent: 

(a) A n -)• 0; 

(b) \w n (z)\ — > 00 locally uniformly on C + ; 

(c) j n ( z ) — > 00 locally uniformly on C + ; 

(d) F n — > locally uniformly on D; 





(a). 
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(e) n n -)• {0}; 

(f) sup h n ^l. 

The condition of part (e) must be interpreted as follows: For every 
r > 0, there exists N so that D r (0) is not contained in Vt n for n > N. 
For example, Q n = D \ [1/n, 1) converges to {0} in this sense. 

In terms of Caratheodory's concept of kernel convergence, condition 
(e) states that no subsequence {Q nj } has a kernel with respect to z = 
0; see again [7, Definition 15.4.1] for background information. 

Proof. This is similar to the previous proof. It's again easy to see that 
(a) (b) <^ (c) <^ (d): Indeed, since < k < 1 on C+, (b) 

and (c) are obviously equivalent. It is also clear that (d) implies (c), 
and conversely, if (c) holds, then at least F n — > locally uniformly on 
D + , but that is enough to conclude (d) by a normal families argument 
again. Since A n = F' t (0), (d) implies (a), and (a) clearly implies (b) 
and (c). 

Obviously, (e) and (f) are equivalent, and thus we can finish the proof 
by relating (f) to one of the first four conditions. If (f) is assumed, then 
Theorem 2.6 shows that also sup_ 2<x<2 ln( x ) — > °°- Since j n (x + iy) > 
7 n (x) for y > 0, this implies that no subsequence of j n can converge 
locally uniformly to a finite harmonic limit function on C + . A normal 
families argument now gives (b). 

Conversely, if (f ) does not hold, say sup h n < c < 1 on a subsequence, 
then Theorem 2.6 shows that there is a corresponding uniform bound, 
1n{x) < C, on x G [—2, 2], along the same subsequence. So 

- In A n + J \n\t- x\ dk n (t) < C. 

Integrate both sides with respect to duo, the equilibrium measure of 
[—2, 2] (this will finish the job in a clean way, but is not really necessary; 
we could also just integrate with respect to Lebesgue measure on [-2,2]). 
Since cap [—2,2] = 1, we know that f\n\t — x\ du (x) = for quasi 
every (in fact: every) t G [—2,2]. Thus Fubini's Theorem yields the 
inequality — \nA n < C on the subsequence that was chosen above. 
This clearly prevents A n from converging to zero. We have shown that 
(a) does not hold. □ 

We would like to emphasize one point here that was already made 
implicitly in our proof of Theorem 4.1. Consider again a sequence 
(An, dk n ) G T>, which converges in the sense that A n — > B > and 
dk n — > dv in weak-* sense. There seem to be three possibilities: (i) 
(B,dv) G V also, that is, B > and (3.1) holds; (ii) B > 0, but (3.1) 
fails; (iii) B = 0. 
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It is very easy to see that (ii) does not occur. This will be used 
several times later on, so we state it separately, for easier reference. 

Lemma 4.3. Let (A n , dk n ) G V and suppose that A n — >■ B > and 
dk n — > dv in weak-* sense. Then either (B, du) G T> or B = 0. 

Proof. Suppose that B > 0. Let j n (z) G C be the Lyapunov exponents 
associated with (A n , dk n ). Then, by passing to the limit in the Thouless 
formula, 

7„(z) ->■ r(^) = - In B + J In \t - z\ dv{t) 

for z G C + , and since r y n (z) > 0, we also have that T(z) > 0. This is 
what (3.1) is asking for, so {B,dv) G V, as claimed. □ 

Finally, let us return to the topic that was already briefly mentioned 
above: We can build compact metric spaces starting from the sets T>, W 
etc. from Definition 3.1. We first introduce a metric in such a way that 
convergence with respect to this metric is equivalent to the conditions 
discussed in Theorem 4.1. These spaces are not yet compact, but we 
can pass to the one-point compactifications by adding a point at infinity 
(as the phrase goes); this extended space also admits a compatible 
metric, and approach to the point at infinity is then equivalent to the 
conditions from Theorem 4.2. 

There is, of course, general theory underlying this procedure; see, 
for example, [16]. However, we can also be explicit here and do things 
entirely by hand. Let us discuss the space V = VU{0} in this style (we 
call the added point because it is approached precisely if A n — > 0). 
We first need a metric on the finite positive Borel measures on [—2, 2] 
that generates the weak-* topology. Fix such a metric and call it D. 
Then let 

d((A, dk), {A', dk')) = \A- A'\ + D(A dk, A' dk') 

for two points from T> and d((A, dk), 0) = A + D(Adk, 0) for the dis- 
tance to added point 0; here, the second argument in D(A dk, 0) denotes 
the zero measure. 

This defines a metric d on D with the desired properties. It follows 
from Lemmas 2.3 and 4.3 and the compactness of the space of proba- 
bility Borel measures v on [—2, 2] that (V , d) is compact. Convergence 
with respect to d is equivalent to the conditions from Theorems 4.1(a) 
and 4.2(a). 

We can give similar metrics on the (one-point compactifications of 
the) other spaces from Definition 3.1. Alternatively, we can just use 
Theorem 3.1 and Theorems 4.1, 4.2 to move things over from T>q to 
those spaces. We summarize: 
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Proposition 4.4. There are metrics on the spaces V = V U {0}, 
Wo = W U {oo} etc. so that convergence with respect to the metric is 
equivalent to the corresponding statements from Theorems 4-1 and \ .2, 
respectively. These spaces are compact. 



We now come to one of the main points of the whole discussion so 
far. We also want to show that given data as in Definition 3.1, there 
exists a shift invariant measure on J 2 that produces these data. 

For the density of states measure dk, this was already shown in [5]. 
(Such a result also appears here, as Proposition 5.2.) Carmona-Kotani 
work with an approximation by periodic problems, which is very similar 
to what we did above in the approximation procedure that was based 
on (3.6), (3.7). In fact, these approximating data do come from periodic 
problems; more generally, finite gap domains yield periodic operators 
if all slits are located at angles that are rational multiples of ir. We 
cannot guarantee that this method also produces the correct A, and 
this issue will have to be addressed separately. This difficulty is directly 
related to the fact that while the density of states depends continuously 
on /i, the quantity A is, in general, only a semicontinuous function of 
H. 

Recall that M was defined as the set of invariant probability mea- 
sures on J 2 with \a.A n = f In a dfi > — oo. If /i ^ Ai , then we 
formally set A a = 0. 

Lemma 5.1. Suppose that ji n G M.q and /i n —> fj, in weak-* sense. 



In particular, [i G Mo z/limsup A Un > 0. 

The inequality can be strict. For example, if J again denotes the 
free Jacobi matrix with a = 1, b = and 



then fi n G .Mo, //„—>■// = 5j , lnA Un = —1 for all n, but lnA u = 0. 

As already mentioned above, we may rephrase by saying that the 
map fi i— > A n is an upper semicontinuous function on the (compact) 
set of invariant probability measures on J 2 . 

This Lemma is supplemented by Lemma 2.2, which says that dk Un — > 
dk,. in the situation under consideration. 



5. Existence of invariant measures 



Then 



(5.1) 



A a > lim sup A t 
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Proof. Since a limit of invariant measures is invariant itself, the final 
claim is an immediate consequence of (5.1), so it suffices to prove this 
inequality. Since J h-> ln(a (J) + e) is a continuous function on J 2 for 
fixed e > 0, we have that 

(5.2) J ln(a (J) + e) dfi n (J) ->■ J ln(a (J) + e) d^(J). 

Moreover, J ln(a + e) rf/i — > InA^ G [— oo, oo) as e — > 0+ by monotone 
convergence, so if (5.1) failed, then we could find a subsequence and 
e > so that 

J ln(ao( J) + e) dfi(J) < J lna (J) dfi n (J) — e 

along the subsequence chosen. However, the integrals on the right- 
hand side are clearly dominated by J ln(ao + e) d[u, n , so this contradicts 
(5.2). □ 

Proposition 5.2. Suppose that T E C Then there exist (j, E Aio and 
d> so that 

Y(z)= 1 ,(z) + d. 
Moreover, i/inf zeC + T(z) = 0, then necessarily d = 0. 

Here, 7 M of course refers to the Lyapunov exponent that is con- 
structed from fx G Mq as in Section 2, via (A^^dk^) and (1.3). 

Proof. This is similar to the argument we used in the proof of Theorem 
3.1 to construct (A, dk), given a conformal map G. First of all, Theorem 
3.1 provides us with associated data (A,dk), W(z), G((), Q, h. Define 
again approximating finite gap domains as in (3.6), (3.7), and denote 
the corresponding data by A n , dk n , w n etc. By Lemma 3.3, there are 
finite gap sets E n C [—2, 2] so that A n = cap E n and dk n = duE n - 

This approximation procedure is exceedingly useful here because if 
E C [—2, 2] is a finite gap set, then we can give a solution to the 
problem we set out to solve, and a fairly explicit one at that. More 
precisely, just take any ergodic measure fi that is supported by TZo(E); 
here, 1Z (E) denotes the set of Jacobi matrices J with a (J) = E that 
are reflectionless on E; these are usually called finite gap operators, 
and they have been studied very heavily. An account of the classical 
theory may be found in [31, Chapter 9], but see also [21, 24] for much 
more on the spaces TZq(E). Note that of course TZo(E) C J2 if (and 
only if) E C [—2,2]. Ergodic measures on IZo(E) exist because these 
spaces are compact and shift invariant. 

We claim that, as desired, A^ = cap E and dk^ = duE for such an 
ergodic \x on 1Z (E). To prove this, it will suffice to show that: (i) 
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7^ = almost everywhere with respect to wg; (ii) dk a is supported by 
E. Compare the final part of the proof of Lemma 3.3 for this step, or, 
better yet, see Proposition 9.2 below. 

These two properties are well known standard facts about finite gap 
operators, so we will be satisfied with just giving a brief review. First 
of all, the absolutely continuous part of the spectral measure dpo(J) 
is equivalent to XE{t) dt for every J G TZq(E), and this is immediate 
from the definition of the property of being reflectionless. See [31, 
Chapter 8] or [20, 24] for background. It follows from (the easy Ishii- 
Pastur part of) Kotani theory [12] that 7 M = (Lebesgue, hence oje) 
almost everywhere on E. Alternative arguments are available, too; for 
example, [20] has a (sketchy, admittedly) discussion of these issues at 
the end of the introduction. 

Moreover, as cr(J) = E for all J G IZo(E), the spectral measures 
Po(J) are supported by E and thus dk u , being their average, also has 
this property. 

Returning to the main argument, we now have available invariant 
measures p n G Mo that produce the (finite gap) data constructed 
above. On a suitable subsequence, which we again assume to be the 
original sequence for notational convenience, we can make the p n con- 
verge to a limiting measure p in weak-* sense. We constructed the 
approximations so that /i n — > /i in the sense of Theorem 4.1(f), so we 
also have part (a) of the Theorem and, in particular, A n — > A > 0. 
Thus Lemma 5.1 guarantees that p G Mo- Lemma 2.2 then shows 
that (on z G C + ) 

ln{z) = - In A n + j \n\t - z\dk n (t) ->■ —hiA + J In \t - z\ dk u (t). 

However, from Theorem 4.1(c) we know that 7„ also converges to T 
locally uniformly on C + . This gives the representation r = 7 M + d, 
with d = \n(A u /A). If we now recall that A = \imA n , then we can use 
Lemma 5.1 to confirm that A u > A, so d > 0, as claimed. The final 
claim is obvious from this, since 7 M > 0. □ 

This is not completely satisfactory. Of course, we would prefer to be 
able to represent T = 7^, without the shift d. To achieve this, we now 
show that we can also represent a larger function than T, and then 
take a suitable convex combination. 

Lemma 5.3. Suppose that F G C Then there exist D > and p G Mo 
so that 

V{z) = 7 „(*) - D. 
As an immediate consequence of this, we obtain the desired result. 
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Theorem 5.4. Suppose that an object as in one of the parts of Def- 
inition 3.1 is given. Then there exists a /i G A4 that generates this 
object. 

In other words, if T G C is given, there exists /J, G Af so that T = 7^, 
or if (A, dk) G V were given, then we can find \i G .Mo so that A = A^ 
and dk = dk^ and so forth. 

Assuming the Lemma, we can indeed easily establish Theorem 5.4, 
as follows. First of all, by Theorem 3.1, it suffices to discuss the case 
where a T G C is given. Proposition 5.2 now yields a /ii G Aio so that 
T = 7 Ml + c?i. If di = here, then we are done. If d± > 0, use Lemma 
5.3 to find /i 2 G A4o an d d 2 > so that T = 7 M2 — d 2 . Then 



d\ + rf; 



2 



also lies in A^o and satisfies 7^ = T, as desired. So it only remains to 
prove Lemma 5.3. 

Proof of Lemma 5.3. By Theorem 3.1, we can write 
T(z) = -\nA+ ln\t- z\dk(t), 

J[-2,2] 

for some (A, dk) G V. Partition [—2, 2] into 2N intervals Ij of length 
2/N each, ignore those Ij with Cj := J T dk(t) = 0, and let 

dkj(t) = -Xi.it) dk(t) 

for the remaining intervals. Then we can recover dk as the convex 
combination dk = ^Cjdkj, and the dkj are themselves admissible 
density of states measures because the integrals f\n\t — z\ dkj(t) are 
still bounded below. 

So we can define Aj > by writing 

(5.3) In Aj = inf / In \t - x\ dkj(t); 

x£R J 

then (Aj,dkj) G V, or, equivalently, Tj G C, where 

rj(z) = - In ^ + J In |t - z| dkj{t). 

By construction, these new functions all satisfy inf Tj = 0. Therefore, 
Proposition 5.2 provides us with measures \ij G A^o so that Tj = 7^ . 
Let n = ^CjfXj, and also observe that InAj < — \nN; indeed, it 
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suffices to take x as the center of Ij in (5.3) to confirm this. We have 
that 

7 M = CjTj = r + In A - ^ cj In Aj = V + D, 
and here we can be sure that 

D = In A - ^ Cj In Aj > In A + In N ■ ^ cj = In A + In iV 

will be indeed positive, provided we took iVeN large enough. □ 

6. Slits and gaps 

Recall the definitions made in the context of Theorem 2.6: Let E = 
top supp dk be the topological (= smallest closed) support of dk. E 
is a compact subset of [—2,2] (with no isolated points), and thus its 
complement (—2,2) \ E is a disjoint union of open intervals Ij, which 
we call gaps. On each gap t G Ij, the function k(t) = Jj_ 2t i dk(s) has a 
constant value kj G [0, 1], which is unique to this gap. We call kj the 
gap label of Ij. 

It is worth pointing out that k = is a gap label in this sense if 
and only if mini? > —2; the corresponding gap is the missing piece 
(— 2, mini?). A similar comment applies to ko = 1 as a gap label. 

We mention in passing that there is an interesting and beautiful 
theory (the Gap Labeling Theorem) that describes the set of possible 
gap labels in terms of the dynamics of the shift map S on top supp \x. 
See, for example, [10, 27] for the classical results and [1] for a recent 
development. 

We saw earlier that if E is a finite gap set, then the gap labels 
correspond exactly to the slits of f2. More precisely, f2 is the unit disk 
with finitely many radial slits removed, and these slits are located at 
the angles e ±l7rfcj , with kj being the gap labels. See Lemma 3.3 and its 
proof for these statements. 

This correspondence between slits and gaps is valid in general, if we 
define the notion of a slit for a general region Q G 1Z appropriately. 

Definition 6.1. Let fl G 1Z, and let h G H be the associated slit height 
function. We say that has a slit at angle e ta if 

h (e za ) > limsup 

h ( e i(a+at)\ 

for at least one of ff = 1 or c = - 1. 

So a slit, in this technical sense, corresponds to an at least one-sided 
jump in the slit height function. 

Theorem 6.1. Let Q G 1Z and < a < ir. Then Q has a slit at angle 
e ia if and only if k = a/n is a gap label of E = top supp dk. 
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Proof. Suppose first that ko G [0, 1] is the label of some gap (a, b), with 
— 2 < a < b < 2. This means that k(t) = k for t G [a, b], but also that 
k(t) 7^ k if t G [—2, 2] \ [a, b\. In this situation, Theorem 2.6 says that 
/ l ( e <T*o) = l_ e -r r = sup a < t < b7 (t). 

The Thouless formula shows that 7 has a harmonic extension to 
C\E, and 

for £ G (a, b). It also follows, with the help of monotone convergence, 
that 7L „ is continuous. So, in particular, at least one of the inequal- 
ities T > 7(a) or T > 7(6) holds. Let's assume that T > 7(a) and also 
that a > — 2 (if a = —2, then T > 7(6) and b < 2, and an analogous 
argument works). Then ^(t) < T — e for all a — e < £ < a for some small 
e > because 7 is upper semi continuous. Now k(a — e) < fc(a) = fc , 
so Theorem 2.6 implies that h(e tnk ) < h(e l7Tk °) — S for some S > and 
all k < k that are sufficiently close to k . This is what we wanted to 
show. 

To prove the converse, we again use Caratheodory's theory of the 
boundary values of conformal maps. Assume that 

(6.1) limsup/i (e i7Tk ) < h (e i7Tk °) = h , 

k—tko— 

the other case being analogous, of course. In more geometric terms, 
assumption (6.1) means that dflh contains an exposed line segment 

(6.2) S = {re inh> : 1 - h + e < r < 1 - h + 2e} 

that can be accessed from through smaller angles. Or, more for- 
mally, we can choose e > so small that also Q C f^, where 

Q = \je ia : 1 - h + e < r < 1 - h + 2e, nk - e < a < 7rk } . 

As a consequence, each point on S from (6.2) corresponds to a different 
prime end. Let us try to say this in more precise language: If z n 
is a sequence of points from Q that converges (in traditional sense) 
to some z G S, then z n , viewed as a sequence from Qh, the union 
of Qh with its prime ends, with the topology discussed in [7, Section 
14.3], converges to some prime end. (This is easy to show, but for 
our purposes here, convergence on a subsequence is enough, and this is 
automatic because flh is compact.) Moreover, and this is actually the 
crucial part, if z 7^ z', then the corresponding prime ends are different 
also. This follows immediately from the way prime ends were defined. 
Finally, recall again [7, Theorem 14.3.4], which says that F extends to 
a homeomorphism F : D — > 0,^. 
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The upshot of all this is the following: We can find two sequences 
Cn, Cn e ^ which converge to two different boundary points (, (' G dD, 
so that -F(Cn), ^(O both converge to points on 5 from (6.2). We 
obtain these sequences by simply picking sequences z n , z' n G Q so that 
z n z, z' n ^ z', and here z, z' are two distinct points from S. We 
then let ( n = F~\z n ), (' n = F~\z' n ). 

In fact, we can and must say slightly more here: Since the z n , z' n can 
all be chosen from the same semidisk (either D + or D~), it is also true 
that (, (' will either both be on the (closed) upper semicircle, or they 
will both be on the lower semicircle. 

If we now go back to the original variables and recall that k(z) is 
continuous on C + U R (see Proposition 2.1(a)), then this says that 
there are t, t' G [—2, 2], t ^ t', with k(t) = k(t') = k . Thus k is a gap 
label. □ 

Tools from the classical theory of conformal maps can be used to 
analyze other questions, too. For example, [7, Theorem 14.5.5] says 
that F : D — >■ Vt has a continuous extension F : D — > f2 if and only 
if dQ is locally connected. Note that we are now seeking an extension 
that takes values in C, so this issue is not directly addressed by the 
theory of prime ends. This result may be used to establish the following 
criterion for the continuity of the Lyapunov exponent. 

Theorem 6.2. Let 7 G C, and let h G H be the associated slit height 
function. Then 7(2) is continuous on C if and only if the following 
holds: (i) If a/n is not a gap label, then h is continuous at e za ; (ii) if 
a/n is a gap label, then hra t _, 0+ h(e l( - a+a ~^) exists for both a = 1 and 
a = -l. 

This can be proved by verifying that dVth is locally connected if and 
only if (i), (ii) hold. Note that as k(z) is always continuous on C + UR, 
the conformal map w has a continuous extension to this set if and only 
if 7 has this property (and in this case, 7 extends continuously to all of 
C, by the Thouless formula). Also, this condition is of course equivalent 
to the possibility of extending F continuously to D. Having made these 
remarks, we omit the detailed proof of Theorem 6.2. An alternative, 
more direct proof that is based on Theorem 2.6 is also possible. 

7. More on Lyapunov exponents 

In this section, we discuss 7(2;) as a function on x G [—2, 2]. Potential 
theory implies that if 71 (x) = 72(2;) for quasi every (that is, off a set 
of capacity zero) such x, then 7! = 72. See [26, Section 1.3]. So this 
restriction of 7 to [—2, 2] still contains all the information. We do not 
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have a description of the set of all these functions, but we are able to 
offer the following statements, which supplement Theorems 4.1, 4.2. 

Theorem 7.1. Let 7„,7 G C Then the following conditions are also 

equivalent to those from Theorem 

(a) 

(7.1) sup ip(x)^f(x) = lim sup <p(x)~f n (x) 

-2<x<2 n ^°° -2<x<2 

for all ip G C[-2,2], <p > 0. 

f&j TTie 7„(x) (n > 1, — 2 < x < 2) are uniformly bounded, and if 
v G V (defined below), then 

(7.2) lim / Yi{x) -7„(x)| dv(a;) = 0. 

Here, we let P be the set of probability measures v on the Borel sets 
of [—2, 2] for which the potential 

(7.3) § v (x) = / \n\t-x\ du(t) 

is a continuous function of x G R. This in particular forces v to give 
zero weight to all sets of capacity zero. On the other hand, for any 
compact K C [—2, 2] of positive capacity, there exists a v G V with 
u(K c ) = 0. See [26, Corollary 1.6.11]. So, in some vague sense, one can 
perhaps say that the class V is equivalent to capacity. 

There are limits to this, however. More specifically, while the T x {y) 
convergence from (b) of course implies convergence in measure with 
respect to every v G "P, that is, 

(7.4) i/(|7 - 7„| > e) ->■ for every e > 0, 

we are not claiming that the capacity of the set where |7„ — 7| > e 
approaches zero, and indeed this latter statement is false. A coun- 
terexample may be constructed by approximating a positive 7 G C, 
say j(x) = 1 on [—2,2], by a sequence of 7„'s corresponding to finite 
gap sets E n , as in the proof of Theorem 3.1 (compare (3.6), (3.7)). 
Lemma 3.3 then shows that 

cap ({x G [-2, 2] : 7„(x) = 0}) = cap E n = A n . 

By construction, the A n approach the positive limit A = F'(0), where 
F G C is the conformal map associated with 7 (so if 7 = 1, then 
F(C) — e_1 Cj Du t we don't need to know this here). 
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Theorem 7.2. Let^ n e C. Then the conditions from Theorem 4-2 are 

equivalent to: 

(a) 

(7.5) lim sup 7„(x) = oo. 

n ^°°~2<x<2 

(b)IfueV, then 



lim / 1n{x) dv(x) = oo. 

«-^°°J[_2,2] 



Since (7.1) and (7.5) are analogous to conditions (f) from Theorems 
4.1 and 4.2, respectively, and, moreover, 7 and h are directly related 
through changes of variables (and a partial maximization), as spelled 
out in Theorem 2.6, it seems tempting to try to relate these directly. 
We are going to give a different, more indirect argument, however, 
which seems easier and more convenient. 

Proof of Theorem 7.2. We start with this because we will use Theo- 
rem 7.2 in our proof of Theorem 7.1. The equivalence of (a) with the 
conditions of Theorem 4.2 is an immediate consequence of Theorem 
2.6, which in particular implies that for any 7 G C, the associated slit 
height function satisfies 



sup h(e ia ) = 1 — exp I — sup 7(2;) 

0<a<n ' V ~2<x<2 

So (7.5) holds if and only if sup h n — > 1, which is condition (f) from 
Theorem 4.2. 

Next, assume that A n — >■ (this is (a) of Theorem 4.2). We want 
to derive (b) from this. Integrate the Thouless formula with respect to 
dv. With the help Fubini's Theorem, this gives 

/ j n (x)du(x) = -\nA n + / § v (t)dk n (t). 

7 [-2,2] J[-2,2] 

Here, § u is continuous by assumption, hence bounded, and thus the 
integrals on the right-hand side stay bounded, and (b) follows. 

Finally, if (b) is assumed, then (a) follows trivially. □ 

In the next proof, we will make repeated use of two fundamental 
potential theoretic results, the lower envelope theorem and the principle 
of descent. We will state them here, but please refer to [26, Theorems 
1.6.8, 1.6.9] for a fuller discussion. 

Suppose that dk n — > dv in weak-* sense. Then 

& v (x) = lim sup 3> n (x) 
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for quasi every x G [—2,2] (the lower envelope theorem). Here, the 
logarithmic potential <3>„ of a measure v is again defined by (7.3), and 
we of course further abbreviated $ n = §dk„- 

This is supplemented by the principle of descent, which says that 

$ v (z) > limsup$ n (,2) 

n— ¥oo 

for all z G C. Again, this is interesting for z = x £ [—2,2]. On 
the complement of this set, the stronger property of locally uniform 
convergence is obvious. 

Proof of Theorem 7.1. We first show that the conditions of Theorem 
4.1 imply (a). Let tp G C[— 2,2], tp > be given. As in the proof of 
Lemma 3.2, we will split (7.1) into two inequalities. We first show that 

(7.6) sup y?7 > lim sup (sup y?7n) • 

n— s-oo 

Since the functions y?7« are upper semicontinuous, the suprema are 
maxima, so if (7.6) were wrong, we would find ourselves in the following 
situation: 

(7.7) sup y?7 < tp(x n )>y n (x n ) - e, 

for all n from a suitable subsequence and certain points x n G [—2,2], 
and here can also assume that x n — > x G [—2, 2] along that same 
sequence. Let dv n be a shifted version of dk n ; more precisely, 

J f(t) du n (t) = J f(t + x-x n )dk n (t) 

for / G C(R). Notice that § Un {x) = ^ d k n (x n ). By (a) of Theorem 
4.1, dk n — > dk in weak-* sense and thus also dv n — > dk along the 
subsequence that was chosen above. Since, furthermore, A n — > A, the 
principle of descent now says that 

7(x) > limsup7„(a;„) 

(the lim sup is taken along some subsequence, but this is irrelevant 
here). Since •p is continuous, this contradicts (7.7), unless <p(x) = 
0. However, if <p(x) = 0, then (7.7) implies that 7„(a; n ) — > oo, and 
we again obtain a contradiction, this time to Theorem 7.2. We have 
established (7.6). 

Next, we show that also 

(7.8) sup </?7 < lim inf (sup y?7n) , 

n^oo 

and this together with (7.6) will of course establish (7.1). Again, we ar- 
gue by contradiction. If (7.8) failed, then we would find a subsequence 
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(p(x)-y(x) > (p(t)-y n (t) + 2e 



for all t G [—2, 2] and all n from that sequence. We can now use 
the fact that 7 is continuous with respect to the fine topology and 
slightly change x to obtain another inequality of this type (with 2e 
replaced by e, say), where we can now also guarantee that x is not 
from the exceptional capacity zero set from the lower envelope theorem. 
Thus 7(2;) = limsup7„(a;). Here, the limsup is taken along the same 
subsequence that was singled out above (this is important); in other 
words, we applied the lower envelope theorem to this subsequence and 
not to the original sequence. We obtain a contradiction to (7.9) with 
t — x. 

To prove that, conversely, (a) above implies part (a) from Theorem 
4.1, we again exploit the compactness properties that were discussed 
in Sections 4. Suppose that (7.1) holds. We can pass to a subsequence 
so that A n — > B, dk n — > dv. Here, by Lemma 4.3, either B = or 
(B, dv) G T>. The first case is impossible because then Theorem 7.2 
would imply that (7.5) holds on the subsequence we chose, but this is 
clearly incompatible with our assumption that we have (7.1). 

So (B, dv) G V, but then, by what we showed already, 

(7. 10) lim (sup y?7„) = sup WIPM 

along the subsequence constructed, for all <p G C[— 2,2], ip > 0. How- 
ever, limits in this sense are unique. In other words, if 7, 7 G C are not 
the same function, then 

(7.11) sup <p(x)~f(x) ^ sup (p(x)'y(x) 



for some nonnegative (p G C[— 2,2]. Indeed, if 7(2:0) < 7(^0), say, for 
some xq G [—2,2], then, as 7 is upper semicontinuous, we in fact have 
that 7(2;) < 7(2:0) — e for all x from some neighborhood of xq also, so we 
can simply take a <p that is supported by this neighborhood, < ip < 1, 
and (p(x ) = 1, and we are then guaranteed that (7.11) holds. 

This uniqueness means that (7.10) forces "f(B,dv) to be the function 7 
from (7.1), and thus, by the uniqueness part of Theorem 3.1, (B, dv) = 
(A,dk), the data associated with 7. So this is the only possible limit 
point of the sequence (A n , dk n ), but any subsequence has a limit point, 
thus the whole sequence has to approach this limit, and this is condition 
(a) from Theorem 4.1. 

Next, we again assume the conditions from Theorem 4.1, and we now 
wish to establish (b). First of all, we certainly have that 7 n (^) < C 
for all n, x and some uniform bound C. We have already shown that 



-2<x<2 



-2<x<2 
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(7.1) holds under the present assumptions, so we can now obtain this 
uniform bound very conveniently by just taking <p = 1 in this condition. 

So we can focus on (7.2). Fix a v G V. We will show that j n — > 7 
in measure, that is, (7.4) holds. This is sufficient because, as just 
discussed, < 7„,7 < C, so L l (v) convergence will follow from this. 

We will argue by contradiction and thus assume hypothetically that 
(7.4) fails. Then there exists e > so that 

(7.12) KIT - 7n| > e) > e 

for all n taken from some subsequence. 

Recall that 7(2;) > limsup7„(x) for all x by the principle of descent. 
So if we are given an r\ > 0, we can find an integer N = N(x, rj) so that 

ln{ x ) < l{ x ) + V f° r ah n > N. 

We can also choose these integers N(x, rj) as a measurable function 
of x G [-2,2]. Then v(N > N ) ->• as N ->• 00 by monotone 
convergence, so we can in fact find a (constant) integer N and an 
exceptional set £ C [—2, 2] with v(£) < rj so that 

ln{x) < 7(x) + 77 

whenever n > N and x ^ £. If we take 77 < e/2, say, then (7.12) now 
has the more specific consequence that 

e 

K7 - In > e) > - 

for all n > N from the sequence that was determined earlier. Abbre- 
viate 

S n = {x G [-2, 2] : ln {x) < ^x) - e}; 
then, as just observed, u(S n ) > e/2 for these n. It follows that 

/ j n (x) dv{x) = / 7 n (x)dv(x)+ / 7„(x) di/(ar) 

J [-2,2] JS n JS^ 

< / 7(x) dv{x) - %- + / 7 n (x) 

/ 7 (x) di/(x) + (C + 1)17 - ^ 
Vf-2,21 1 



< 



To obtain the last line, we further split into two parts. On fl S c , 
we have the inequality j n < 7 + rj, so this part of the integral may be 
estimated by f qc 7 <iz/ + 77, and on fl £, we just use that 7„ < C and 

1/(5) < v- 
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If we took i] > so small that (C + 1)?7 < e 2 /2, then this says 
that j ' ^ n du < J 7 du — 5 for some 5 > and all n from a cer- 
tain subsequence. This is impossible because we can also show that 
f 7„ du — > J 7 du. This is done as above, by integrating the Thouless 
formula and using Fubini's Theorem: 

/ 7 n (x) du(x) = - In A n + / dk n (t) 

J [-2,2] J[-2,2] 

-> - \nA+ / $ v (t)dk(t) 

J [-2,2] 

= / 7(x)dis(x), 

J [-2,2] 

because A n — > A > and dk n — >■ rf/c in weak-* sense by assumption, 
and, also by assumption, $^ is a continuous function. This contradic- 
tion proves (7.4). 

Conversely, if (b) is assumed, we repeat the argument from above: 
Consider any subsequence on which A n — > B > 0, dk n — > dp. We want 
to show that then necessarily B = A > 0, dp = dk, where (A, dk) G T> 
are the data of 7. As above, B = is impossible because then Theo- 
rem 7.2(b) would apply on the corresponding subsequence, and this is 
incompatible with our assumption that (7.2) holds. So (B, dp) e V by 
Lemma 4.3. As a consequence, by what we showed already, 7„ — >■ l(B,d P ) 
in L 1 (^) along the corresponding subsequence. Thus 7(B,dp)(a ; ) = t(^) 
almost everywhere with respect to u for all u e V. This implies that 
l(B,dp){. x ) = 7(2;) for quasi every x G [—2,2] because, as we reviewed 
above, any positive capacity set admits a measure u G V that is sup- 
ported by it. We conclude that 7(B,dp) = 7 are th e same function, thus 
(B, dp) = (A, dk) by the uniqueness part of Theorem 3.1. □ 

8. Positive Lyapunov exponents 

In this section, we present a variation on a theme composed by Avila 
and Damanik [2] . These authors show that if an ergodic system is fixed 
and factors (= homomorphic images) are considered, then generically 
the Lyapunov exponent is positive Lebesgue almost everywhere, with 
respect to a natural topology. 

The material discussed in this paper provides a very natural approach 
to these issues. The key fact is the following consequence of Theorem 
7.1(b). 

Lemma 8.1. Let u eV. For any a, b > 0, the set 

S(a,b) = {7 G C : 1/(7 < a) > b} 
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is a closed subset of (the metric space) C. 

Here, we again use the customary self-explanatory notation where a 
condition is used to denote the set it defines. 

Proof. Let v G V. Suppose that 7„ G S(a,b), 7 G £, 7„ — > 7 in the 
sense of Theorem 4.1(c) or one of the equivalent descriptions of this 
mode of convergence. 

Given e > 0, no matter how small, Theorem 7.1(b), or rather its 
consequence (7.4), lets us find an integer N and an exceptional set 
£ C [-2,2], such that v{E) < e and \-f N (x) - 7(2;) | < e if -2 < x < 2, 
x £. Since 7^ G S(a, b) by assumption, this implies that 

1/(7 < a + e) > 6 — e. 

With the help of the monotone convergence theorem, one can now 
check that this condition for arbitrary e > implies that 7 G S(a,b), 
as desired. □ 

The Lemma can be rephrased, as follows: The function 7 h-> 2/(7 < a) 
is upper semicontinuous. Compare this formulation with [2, Lemma 1]. 

Corollary 8.2. Let v eV. Then the set 

{7 G C : 7(rr) > for //-almost every 

is a dense G$ subset of the compact metric space £ . 

Recall that Co was defined as the one-point compactification of £; 
please review Proposition 4.4 and its discussion in this context. 

The Corollary has further implications because, by the classical Kotani 
theory [12], absolutely continuous spectrum for ergodic systems corre- 
sponds to zero Lyapunov exponents. See [2] for these aspects of the 
Corollary. 

Proof. By Lemma 8.1, the sets 

U(a, b) = S(a, b) c = {7 G C : 1/(7 > a) > 1 - b} 

are open in C and thus also in £ - Monotone convergence shows that 
1^(7 > 0) = lim a ^ + ^(7 > a), so the set from the Corollary may be 
represented as follows 

f]\JU(a,l/n); 

n>l a>0 

it is a countable intersection of open sets, as claimed. It is also dense 
because for any 7(2) G £, we have that 7(2) + 1/n G C also, and this 
sequence converges to 7(2) in C (Approximation of 7 = 00 by members 
of the set from the Corollary is of course a trivial assignment.) □ 
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9. Ergodic measures 



Return to the discussion of Section 5. We are given a T G C (or other 
data with the properties from Definition 3.1), and we constructed an 
invariant measure \i G Alo so that T = 7 M . We cannot guarantee 
that /i will be ergodic here (even if Proposition 5.2 already provides 
the correct \i and we choose the approximating measures fi n as ergodic 
measures, really nothing has been achieved because a limit of ergodic 
measures need not be ergodic itself). It is natural to ask if it is also 
possible to find an ergodic /i so that T = 7 M . 

Unfortunately, we don't have anything substantially new to say on 
this interesting question. Basically, we will review and put into context 
some observations made by Kotani in [13], and then point out some 
obvious open questions. 

Proposition 9.1. Suppose that T G C is an extreme point of the convex 
set C Then there exists an ergodic measure ji G Alo so that T = 7 M . 

This does not come as a big suprise. Ergodic measures are precisely 
the extreme points of the set of invariant measures, so one would expect 
extreme points to play a role here. The converse of Proposition 9.1 is 
false, however. A counterexample is provided by any ergodic model 
whose Lyapunov exponent satisfies 7 > c > 0. This behavior has 
been established for the Lyapunov exponent of the Almost Mathieu 
operator for large coupling [4] (in fact, Bourgain-Jitomirskaya compute 
the Lyapunov exponent exactly). Such a Lyapunov exponent is not an 
extreme point of C, for the simple reason that 7 ± c G C also, and of 
course 7 = |(7 + c + 7 — c). 

Proof. Suppose that r G C is an extreme point, and let ji G Afo be 
an invariant measure so that Y = 7^. We now use Choquet theory 
(see [19], especially Sections 3 and 12 of this reference) to decompose 
/i = J isda(is) into ergodic measures v on the Borel sets of J 2 . This 
means that 



for all bounded Borel functions /. Choquet 's Theorem says that there 
is such a measure da, with the following additional properties: it is a 
probability measure on the Borel sets of the space Af of invariant prob- 
ability measures on (the Borel sets of) J2 (with the topology induced 
by the weak-* topology of the regular Borel measures on J 2 , viewed as 
the dual of C(^J 2 ))- Moreover, and this is crucial, da is supported by 
the subset of ergodic measures. 
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We claim that we then also have that 

(9.1) 7 „(z)= / lv (z)da{v) 

Jm 

for z G C + . Indeed, if we set 

L n {J) = max{lna (J), — n}, 

say, then monotone convergence, applied a total of three times, shows 
that 

\nA fl = / In do (J) dfi(J) = lim / L n (J) dfi(J) 
Jj 2 ' n ^°°Jj 2 

= lim / da{v) [ dv{J)L n {J) 
n ^°° Jm hi 

= / da(v) lim / dv(J) L n (J) = \ \n A u da(v). 
Jm n ->°° Jj 2 Jm 

(This also shows that da is supported by .Mo-) Furthermore, by 
just chasing definitions, we can also easily confirm that J f dk^ = 
J da(u) f dk u f for continuous /, so we do obtain (9.1) by integrat- 
ing the Thouless formula for j u with respect to da. 

Now 7^ is an extreme point by assumption, so if M C Ai is any 
Borel subset, then necessarily § M ^ v da = a(M)j^ also. In particular, 
sets of the type 

M z , e = {veM : ~{ v {z) > llx {z) + e} , 

with z G C + , e > all satisfy a(M Zj€ ) = 0, and of course the same goes 
for sets defined by an inequality of the form j v (z) < 7 M (z) — e. Thus, 
by taking a suitable countable union, we see that ^ v = 7 M for a-almost 
every v G M.q. As pointed out above, almost all of these measures v 
are also ergodic. □ 

So it would be interesting to know what the extreme points of £ 
are. As observed above, 7 is not an extreme point if inf 7 > 0. At the 
other end of the spectrum, we have the following statement, which we 
adapted from [13, Theorem 6.3] and its proof. 

Proposition 9.2. Let (A, dk) G V, and let 7 G C be the corresponding 
Lyapunov exponent. Write E = top supp dk C [—2,2]. Suppose that 
one of the following equivalent conditions holds: 

(a) A = cap E, dk = duE; 

(b) 7(t) = for quasi every t G E; 

(c) j(t) = for UE-almost every t G E. 
Then 7 is an extreme point of C 
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So here we assume that 7 = essentially everywhere where this 
function can be equal to zero. Thus there is a huge gap between the 
Proposition and our first observation that 7 is not an extreme point if 
7 > c > everywhere. 

Proof. The equivalence of (a)-(c) follows from a routine application 
of potential theoretic tools; compare, for example, [28]. We sketch 
the argument here for the reader's convenience. First of all, if (a) 
is assumed, then what (b) asserts is known as Frostman's Theorem 
[22, Theorem 3.3.4]. Next, (b) clearly implies (c) since u E gives zero 
weight to all sets of capacity zero. If (c) holds, then we can integrate 
the Thouless formula with respect to oje and use Fubini's theorem to 
obtain that 

= -]nA+ <& UE (t) dk{t) = ln(cap E/A). 

J [-2,2] 

The last step again depends on Frostman's Theorem. So we indeed 
have that A = cap E. On the other hand, we may also integrate with 
respect to dk, and we then obtain that 

I(dk)= / dk{t) I dk{x) \n\t-x\ > In A 

J [-2,2] J [-2,2] 

The equilibrium measure oje may be characterized as the measure that 
maximizes I among all probability measures supported by E, and this 
maximum value equals I{uje) = In cap E. Thus it now follows that 
dk = duE, and we have obtained (a). 

Such a 7 clearly is an extreme point. Indeed, if 7 = 5(71 + 72), 
then, by Theorem 3.1, we must also have that dk = \{dk\ + dk?), 
so, in particular, Ei,E 2 C E and hence 7j = quasi everywhere on 
Ej also. As we just saw, this property identifies dkj = dojE 3 as the 
corresponding equilibrium measures. As 7^ > on E c -, it in fact follows 
that E 1 = E 2 = E and thus 71 = 72 = 7- □ 

This provides a class of examples where ergodic measures can always 
be found. We do not know if there are any r G C that do not admit 
ergodic measures for their representation. Note also that a certain 
subclass of the examples discussed in Proposition 9.2 has the much 
stronger property that every /i G M with T = 7^ is ergodic (which 
also means that there is only one such /i because otherwise we could 
take convex combinations to obtain non-ergodic /x's). This happens 
when E is a finite gap set with rationally independent gap labels (this 
is classical and follows from an analysis of the shift on these spaces; 
see [31, Chapter 9]), but also for certain sets E with infinitely many 
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gaps and this property (we know this thanks to work of Sodin-Yuditskii 
[30]). It is not clear if there are other examples of Lyapunov exponents 
T with this property that there is only one (ergodic) \i with T = 7^. 

10. Invariance under Toda maps 

In this final section, we show that w is invariant under maps of Toda 
type. We will give a simple abstract version of this result, which, at 
the same time, will also be more general. It is not necessary here to 
be familiar with the theory of Toda flows. We consider continuous 
maps ip : Ji — > J2 that preserve the shift dynamics in the sense that 
Sip = ipS. This also makes sure that the induced map /i ip/i on 
the probability measures on the Borel sets of J2 preserves the property 
of being an invariant measure; recall in this context that the image 
measure <pfi is defined by the condition that J f d(ip/j) = J f o tpdfi 
for continuous functions /. The invariance of ip/i is most elegantly 
established by observing that a measure is S invariant precisely if it 
coincides with its image measure under S. Now the fact that S and p 
commute implies that similarly Sip/J, = ipS/j,, and this latter measure 
equals ip/j, by the invariance of /i. 

Theorem 10.1. Suppose that p : J 2 — > J2 is a bijective continuous 
transformation that commutes with the shift, Sip = ipS, and preserves 
spectra: a(ip(J)) = cr(J). Then ip/i G Aio f or every fj, G Aio and 
w w = Wp. 

As alluded to above, the time one map of any Toda flow has these 
properties; please see [31, Chapter 12] and [25] for background. In 
particular, there is a reasonably large supply of such maps. Note, 
however, that while (classical) Toda flows act by unitary conjugation, 
we are not assuming here that ip (J) and J are unitarily equivalent; the 
spectra are only conserved as sets. 

The invariance of w under (genuine) Toda flows was established ear- 
lier by Knill [11]; see especially Theorem 5.1 of [11]. 

Proof. First of all, it suffices to prove this for ergodic measures \x G .Mo- 
To see this, we again use the ergodic decomposition of an invariant 
measure \i that was discussed in the previous section (see the proof of 
Proposition 9.1). So write /j, — J vda{v). It follows directly from the 
definitions that then similarly ipfj, = j ipv da(v), and the measures ipv, 
as well as the measures v themselves, are ergodic cr-almost everywhere. 
So if we can show the invariance of w u under ip for ergodic u, then we 
will obtain the general case from (9.1). A similar argument is possible 
concerning the claim that ip/i G M . 
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Given an ergodic p G M.q, we will first approximate it by the mea- 
sures p t = F e p. Here, we use the same notation as in the proof of 
Proposition 2.1; see (2.4) and the discussion that follows. Note that 
the p e are also ergodic, and recall that 7^,(2) — > 7 M (z) as e — > 0+, for 
z G C + . 

We will then approximate these measures p e by periodic measures; 
here, we call a probability measure p on J 2 periodic if it is of the form 




for some J G J2 with S P J = J. We formulate this step as a separate 
Lemma. 

Lemma 10.2. Let p be an ergodic measure on J 2 . Then there are 
periodic measures p n so that p n — >■ p in weak-* sense. Moreover, if p 
is supported by J^ ' then the p n can be chosen so that they also have 
this property. 

Proof of Lemma 10.2. The ergodic theorem says that if / G C^J-i) is 
given, then 

\im - J2 f(S J J) = [ f(J)dp(J) 

for /i-almost every choice of J. Since C{J-i) is separable, this implies 
that also 

1 P 

(10.1) p= lim -Y^5 S3 j 

i=i 

in weak-* sense for /i-almost every J. Fix such a J, and consider 
periodic modifications J' of J. More precisely, we let J' have the same 
coefficients as J on n = 1, 2, . . . ,p for some p > 1, and then continue 
periodically. In other words, (a',b') n = (a,b) n for n = l,2,...,p, 
and the remaining coefficients are obtained from the condition that 
(a', b') n+p = (a', b') n for all n G Z. 

Recall now how the metric d on J2 was defined; see (1.1). Since 
a, 6 G it follows that we can find a constant C so that 

d(S j J, S 3 J') < C2- pl/2 for p 1/2 <j<p- p 1/2 . 

Indeed, if j is from this range, then the coefficients of S^J and S^J' 
agree on an interval centered at of size at least p 1 ^ 2 , and the estimate 
follows at once from this. If we now replace J with J' = J' p in (10.1), 
then the periodic measures obtained in this way will still converge to p 
because S j J' will be uniformly close to S j J for the lion's share of the 
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sum, and because of the factor 1/p, the remaining p 1 / 2 summands 
cannot make an appreciable contribution. 

This procedure also establishes the final claim because all J' will be 

in j!f* if J was from this subspace. □ 

Apply this to the measures p e . We obtain periodic measures p„ )e — > 
fi e . Now for a periodic measure p, we certainly have that 7 W = 7 p . 
This follows because if p = (1/p) then ipp = (1/p) ^S^gUi but 

since <p commutes with S, this is again a periodic measure, and it is 
formed with the periodic Jacobi matrix <p(J). For a periodic operator 
J and the associated measure p, the corresponding data are (A p , dk p ) = 
(cap E,cLue), where E = u{J) (compare also our discussion of finite 
gap domains in this context). So they only depend on the spectrum of 
J, but we assumed that <p preserves this. 

Thus 7 w „ i£ = 7^ . We now send n — > oo. Since the measures p„ ie 

are all supported by J^f* , we can be sure that 7^ — > 7 Me ; compare 
again the proof of Proposition 2.1 for this step. There is no such 
additional information available for the measures <PAV«> so we will just 
use Lemmas 2.2 and 5.1 here. Notice that we do know that <pp„ i(E — > y?p e 
as n — > oo. It follows that 

with c e > 0. If we now also take e — > 0+, then, as just explained, the 
left-hand side will converge to 7 M . On the right-hand side, we again 
refer to Lemmas 2.2 and 5.1 to conclude that 

(10.2) 7m(*)=7w*(*)+c ( c >°)- 

In particular, we have learnt from this argument that — In A Ws stays 

bounded as e — > 0+, so Lemma 5.1 does make sure that ip/i G A4 , as 

claimed. 

To obtain the full assertion of the Theorem, all that remains to be 
done is to let ip/i and p = (p _1 ((pp) swap roles. So only c = is 
possible in (10.2). Since 7 and w determine each other, the claim may 
be phrased in terms of w, which is what we did in the formulation of 
the theorem. □ 
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